Dynamic Inverted Index Maintenance
نویسنده
چکیده
The majority of today’s IR systems base the IR task on two main processes: indexing and searching. There exists a special group of dynamic IR systems where both processes (indexing and searching) happen simultaneously; such a system discards obsolete information, simultaneously dealing with the insertion of new information, while still answering user queries. In these dynamic, time critical text document databases, it is often important to modify index structures quickly, as documents arrive. This paper presents a method for dynamization which may be used for this task. Experimental results show that the dynamization process is possible and that it guarantees the response time for the query operation and index actualization. Keywords— Search engine, inverted file, index management.
منابع مشابه
A Hybrid Approach to Index Maintenance in Dynamic Text Retrieval Systems
In-place and merge-based index maintenance are the two main competing strategies for on-line index construction in dynamic information retrieval systems based on inverted lists. Motivated by recent results for both strategies, we investigate possible combinations of in-place and merge-based index maintenance. We present a hybrid approach in which long posting lists are updated in-place, while s...
متن کاملInverted index maintenance strategy for flashSSDs: Revitalization of in-place index update strategy
An inverted index is a core data structure of Information Retrieval systems, especially in search engines. Since the search environments have become more dynamic, many on-line index maintenance strategies have been proposed. Previous strategies were designed for HDDs. Consequently, in order to avoid expensive random access cost, Merge-based strategies have been preferred to In-place index updat...
متن کاملFast Construction and Maintenance of the HYB Index
We show that a HYB index can be constructed twice as fast as an ordinary inverted index. As shown in a series of recent works, the HYB index enables very fast prefix searches, which in turn is the basis for fast processing of many other types of advanced queries, including autocompletion, faceted search, synonym search, errortolerant search etc. HYB can be viewed as a “half-inverted index“ and ...
متن کاملA social inverted index for social-tagging-based information retrieval
Keywords have played an important role not only for searchers who formulate a query, but also for search engines that index documents and evaluate the query. Recently, tags chosen by users to annotate web resources are gaining significance for improving information retrieval (IR) tasks, in that they can act as meaningful keywords bridging the gap between humans and machines. One critical aspect...
متن کاملParallel Inverted Indices for Large-Scale, Dynamic Digital Libraries
PARALLEL INVERTED INDEX FOR LARGE-SCALE, DYNAMIC DIGITAL LIBRARIES
متن کامل